1 TCACAGAACA TGTCCAACAA CAGCCCCGAG TATGCTTTGG TTTTCACCAT 
51 CTCGGGTGCT ATGGCCACCA TGGTCTCCAG TGGCCTGGGT GCTGCCTGTG 
101 GCATGGCCAA GAATGGCACC GGCATCATGG CCATGTCTGT CATGTGGCCA 
151 GAGCTGATCC ACATGAAGTC CATCATCCCA GTGGTCATGG CTGGTATCAT 
201 CACCATCTAT GGCCTAGTGG CGGCTGTCCC CCCTGCCAAC TGCCTGAATG 
251 ATGACAACAG TCTCTATAGC AGTTTCCTCC AGCTGGGCGC TGGCCTGAGT 
3 01 GGCCTGGCAG CCGGCTTTGC CATCGTCATC GTGGGGGACA CTGGCAAGTG 

3 51 TGGCACTGCC CAGCAGCCCC GACTATTTGT AGGCATGATA CTGATCCTCA 

4 01 TCTTTGCCAA GGTGCTCATT CTCTCCACAA AGCAGCCCCT CTCAAAACCC 

4 51 ACCAGTCACA GAATACGATG TAAAGACCAC CCCTCCTCAT TCCGGAACAA 

5 01 ACAGCCTGAC ACGCATGTGC TGGGCAGCTG GCCCTCAGTA GTTGATCTTC 
551 TAAGTGTACA GTGTCCTCGT GTTCATCGTC TGTTGGCCAG GCCTTGCCCC 

6 01 CTCCCGCCCC ATGCTGTGGA CATCTGAACC TAC 



FEATURES: 

5'UTR: 1-9 

Start Codon: 10 

Stop Codon: 625 

3 T UTR : 628 
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HOMOLOGOUS PROTEINS: 

Top BLAST Hits: 

Score E 

|4502313 |ref | NP_001685 . 1 | ATPase, H+ transporting, lysosomal ... 181 5e-45 

| 137477 |sp|P23956|VATL_BOVIN VACUOLAR ATP SYNTHASE 16 KD PROT . . . 180 8e-45 

|227919|prf | |1713409A H ATPase 16K [Bos taurus] 180 le-44 

(3024812 | sp | 018882 | VATL_SHEEP VACUOLAR ATP SYNTHASE 16 KD PRO... 178 3e-44 

| 418179 |sp|Q03 105 jVATLJTORMA VACUOLAR ATP SYNTHASE 16 KD PROT... 175 5e-43 

| 6753144 | ref |NP_033859 .1 | ATPase-like vacuolar proton channel... 174 8e-43 

| 67954 |pir | | PXBOV6 H+- transporting ATPase (EC 3.6.1.35), vacu... 165 3e-40 

| 13 7478 |sp|P233 80 |VATL_DROME VACUOLAR ATP SYNTHASE 16 KDA PRO... 157 le-37 

j 3334403 |sp|O16110|VATL_AEDAE VACUOLAR ATP SYNTHASE 16 KD PRO... 156 2e-37 

| 1718095 |sp|P55277|VATL_HELVI VACUOLAR ATP SYNTHASE 16 KD PRO... 153 le-36 

j 401334 |sp|P31403|VATL__MANSE VACUOLAR ATP SYNTHASE 16 KD PROT... 153 le-36 

|l0442628|gb|AAG17394.l|AF277150_l (AF277150) V-ATPase 16 kD . . . 151 5e-36 

|7294725|gb|AAF50062.l| (AE003544) CG754 7 gene product [Droso... 151 7e-36 

j 2493142 [sp|Q26250|VATLJNEPNO VACUOLAR ATP SYNTHASE 16 KD PRO... 150 2e-35 

|251354 |gb| AAB22509.l| vacuolar H(+) -ATPase proteolipid subun. . . 150 2e-35 

|2493143|sp|Q00607|VATL_CANTR VACUOLAR ATP SYNTHASE 16 KD PRO... 144 9e~34 



BLAST to dbEST: 



9336427 /dataset=dbest /taxon=960.. 

6359805 /dataset=dbest /taxon=9606 

9134224 /dataset=dbest /taxon-9606. 
10219114 /dataset=dbest /taxon=96 . . 

9347217 /dataset=dbest /taxon=960.. 

9152104 /dataset=dbest /taxon=9606. 

9894156 /dataset=dbest /taxon=960.. 



Score 
165 
165 
165 
165 
165 
161 
155 



E 

le-38 
le-38 
le-38 
le-38 
le-38 
2e-37 
le-35 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 





9336427 


Human 


uterus 


gi 


6359805 


Human 


fetal liver 


gi 


9134224 


Human 


brain 


gi 


10219114 


Human 


lung 


gi 


9347217 


Human placenta 


gi 


9152104 


Human 


skin 


gi 


9894156 


Human 


ovary 



Expression information from PCR-based tissue screening panels 

Human Bone marrow 

Human Brain 

Human Colon 

Human Fetal Brain 

Human fetal heart 

Human Fetal Kidney 

Human fetal liver 

Human Heart 

Human Kidney 

Human Liver 

Human Lung 

Human Pancreas 

Human Placenta 

Human Prostate 

Human Skeletal Muscle 

Human Small Intestine 

Human Spleen 

Human Testis 
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1 MSNNSPEYAL VFTISGAMAT MVSSGLGAAC GMAKNGTGIM AMSVMWPELI 

51 HMKSIIPWM AGIITIYGLV AAVPPANSLN DDNSLYSSFL QLGAGLSGLA 

101 AGFAIVIVGD TGKCGTAQQP RLFVGMILIL IFAKVLILST KQPLSKPTSH 

151 RIRCKDHPSS FRNKQPDTHV LGSWPSWDL LSVQCPRVHR LLARPCPLPP 

2 01 HAVDI 



FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PSO0O01 ASN__GLYCOSYLATIONN-glycosylat ion site 



[2] PDOC00005 PS00005 PKC_PHOSPHO_S ITS Protein kinase C phosphorylation site 
Number of matches: 4 

1 111-113 TGK 

2 139-141 STK 

3 149-151 SHR 

4 160-162 SFR 



[3] PDOC00006 PS00006 CK2_PHOSPHO_SITECasein kinase II phosphorylation site 
Number of matches : 2 

1 78-81 SLND 

2 176-179 SWD 



[4] PDOC00008 PS00008 MYRISTYLN-myristoylation site 
Number of matches : 8 



1 


16-21 


GAMATM 


2 


25-30 


GLGAAC 


3 


27-32 


GAACGM 


4 


31-36 


GMAKNG 


5 


68-73 


GLVAAV 


6 


93-98 


GAGLSG 


7 


98-103 


GLAAGF 


8 


172-177 


GSWPSV 



35-38 NGTG 
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Membrane spanning structure and domains: 

Helix Begin End Score Certainty 

1 14 34 1.889 Certain 

2 37 57 0.733 Putative 

3 60 80 2.030 Certain 

4 95 115 1.775 Certain 

5 127 147 1.699 Certain 
BLAST Alignment to Top Hit: 

>gi | 4502313 | ref | NP_001685 . 1 | ATPase, H+ transporting, lysosomal 
(vacuolar proton pump) 16kD 

>gi | 137479 | sp| P27449 |VATL_HUMAN VACUOLAR ATP SYNTHASE 16 

KD PR0TE0L1P1D SUBUNIT >gi | 107394 | pir | |A39367 

H+- transporting ATPase (EC 3.6.1.35) chain PKD1 - human 

>gi|l89676|gb|AAA60039.l| (M62762) vacuolar H+ ATPase 

proton channel subunit [Homo sapiens] 

Length = 155 

Score = 181 bits (455) , Expect = 5e-45 

Identities = 110/153 (71%), Positives = 114/153 (73%), Gaps = 14/153 (9%) 

SNNSPEYALVFTISGAMATMVSSGLGAACGMAKNGTGIMAMSV™ 61 
S + PEYA F + GA A MV S LGAA G AK+GTGI AMSVM PE I MKSIIPWMA 
SKSGPEYASFFAVMGASAAMVFSALGAAYGTAKSGTGIAAMSVMRPEQI -MKSIIPWMA 62 

GI I TI YGLVAAVPPANSLNDDNSLYSS FLQLGA GLSGLAAGFAIVI VGDTGKCGTA 117 

Gil IYGLV AV ANSLNDD SLY S FLQLGA GLSGLAAGFAI IVGD G GTA 
GI IAI YGLWAVLIANSLNDDISLYKSFLQLGAGLSVGLSGLAAGFAIGIVGDAGVRGTA 12 2 



Query: 


2 


Sbjct : 


4 


Query: 


62 


Sbjct : 


63 


Query: 


118 


Sbjct : 


123 



QQPRLFVGMI LILIFA+V LILSTK 



Hmmer search results (Pfam) : 

Model Description Score E-value N 

PF00137 ATP synthase subunit C 14.8 0.028 2 

Parsed for domains : 

Model Domain seq-f seq-t hmm-f hmm-t score S-value 

PF00137 1/2 12 73 . . 1 65 [. 7.6 2.4 

PF00137 2/2 89 133 . . 1 53 [. 14.6 0.031 
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1 GCTGTGGGGC CAGGAAAAGG AGAGAAGGTG AAACCCCCGT CAGTCCCTCA 
51 CAATCAGCAC GTGGAAATCT AGAAATG C AG GAGAGGCCTG GACTCATGGT 
101 GGAATCCAGA ATGAAAGAGG TGGACGACTG AATGAGCAGA AGGAGGCAAG 
151 CACCAGAGGC TTGGGGGTCA CATTTCTTGG AAGTGGCCTG GAGCTGGCAG 

2 01 ATGAGAACTC TGGCTACCCG TCCTCATTCC AC TAACAGTA GCTCCTCTAA 
251 CGACATGCCC CTTCCCTCTG TACCCCGCTC CGCATGCGGC AAGTAGTTCC 

3 01 CGGACGCGAC CCTTCCCCCT GTACCCCGCT CCGCATGGGG CCAGTAGTTC 

3 51 CCGGACGCGC CCCTTCCCTC TGTACCCCGC TCCGCATGCG GCAAGTAGTT 

4 01 CCCGGACGCG CCCCTTCCCT CTGTACCCGG CTCCGCATGC GGAAAGTAGT 
4 51 TCCTACGGTG TTGGTTTTGC ATGTAGATGA AACCCTTTGA GGGGTAAAGG 
501 TTTTTTTTTT AAGTACTTTA GCAAATGCAA ACTGTTATTA TCAATATTAG 
551 CCAGCATCTT TTTTTTTTTT TTTTTTTTTT TTTTTGAGAT GGAGTTTCGC 
601 TCTTGTCACC CAGGCTGGAG TGCAATGGCA AAATCTAGGC TCACTGCAAC 
651 CTCCGCCTCC CAAGTTCAAG CGATTCTCCT GCCTCAGCCT CCCAGGTAGC 
701 TGGGATTACA GGCGTGTGCA ACCACACCCA GCTAATTTTT GTATTTTTAG 
751 TAGAGACAGG GTTTCACCAT GTTGGCCAGG CTGGTCTCGA ACTCCTGACC 
801 TCATGTGATC CATCCGCCTC AGCCTCCCAA AGTGCTGGGA TTACGTAGCC 
851 AGTGTCTTTC TTAAGTGCCT GTCAAATAAT GCTCCTGGTT TATAAGTGCC 
9 01 CCTGGCTCTA CCTTCTGGGT GCTCAGACAC CAACACAGAG AGAACAGAAT 
951 TAACATCCTG AGAAGTTACA TATGCTAAAA TATAAAGAGT AAGATTGTGA 

1001 GGAAAGTGCA GGGGAAGCAG GTAGGTTAGG AAAAGGTATC CTCACTTTTC 
1051 TGCTGACCGA TGAGTCATAA TTCTTGAATT TCGGTGCTGG AAAGGTCCAT 
1101 TAAGCATTCC AGGAGATTCT AGGGAGCTTC CAGAATGGTA GAAGAACTGG 
1151 AACCATAAAG CCTGGGGAAG GGATGGAAGT CCTTGGGAAA GAAGCACTAA 
1201 ACAGCCCAGT GGAGACAAGG AAGGACTGGT CTCTCCTGTG CTTCGAGCCC 
1251 AGCAATGATT ATTCACTCAG ATATGCCCCG GCAGGTCCTG CTGCTAGAGC 
13 01 CAGTGCTGTT CCCAGACCCA GGCAAGGTGC CATCCTACCC CTGACAGGAA 

13 51 ACAGGGCAGG AGGTGGGGCT GCCCCGGGTG CCTGGTGTTG GGAGGGGCCG 

14 01 GGGGGAATCC CGGGTGTTGG GAGGACAAGG CAGAGTCAGC TAGCTGTGAG 
1451 GCTAGGGGAG AAGACCTCTC TAGTCTGGGA GAGACCCCTC CTTTCCTAGC 
1501 TCCTTGTACT TCCAAAAAGG CAGGCTTCCT GCTGTTACTA ACCATACCAG 

15 51 GACTGACTAT ACAGCAGCCA GAAAGATTCT GAGAAACCTG TGATAGAGAA 

16 01 AAACAGATGC GGAAGCGGGA GAAGAGAATT TCATAGGACA CTAGGGAAAG 
1651 AGAATGGGAA CTTGTGGTCT AAAGAGGGAA CCAAGTCTGG CCAACATGGT 
1701 GAAACCCCAT CTCTATTACA AATACAAAAA TTAGCTGGGC ATGGTAGTGC 
1751 ATGCCTGTAA TCCCAGCTAC TCAGGAGGGT AAGGCATGAG AATCACTTGA 
1801 GACTGGGAGG CGGAGGTTGC AGTGAGCCGA GACTGCACCA CTGCACTCCA 
1851 GCCTGGGCAA CAGAGCAAGA CCTCGTCTCA AAAAAAAAAA AAAATTAAAT 
1901 TTAAATTAAA AAAAAATAAA CAGGGAACCA ACAAGAGCTG GCAGAACAGA 
19 51 ATAAAGTCTC AAGCCAAATA ACTCCCTTGC CTTGGAAGAA CAAGGCTGCC 
2001 AGCGTCTTGG AGCCTCTGTT TATCGGGTAC CAGTTCAAAG GACAGTGAGC 
2051 CTGAGCTGGC CTGGGAGGCC CTCCCCCCTC CCAGATGAAA ACAATAGGCC 
2101 TGTTTTCCTG AGCTCTTCCT GTAATCCAGA AGGCCCACAC AGAGAGGAAG 
2151 AGGGGGGCAA AGGCAGTGGC TATACCCAGT GGGGGAGGGG ATATTTAGCC 
2201 TCCCATAAAT TCATCAGCTC CCTTAAAGAC ACCCCAAAAC ACCAACAATC 
2251 TAAGTGTTAA AATAGTGACT GCTATGCAAA TGGAGCTTTA AAACCTATCC 
23 01 CTTAGCCCCA GT AC C AC C AG AT T ACTAACC CTAAACCCCA TCTGTAGGAG 

23 51 ATATTCTGAA GCCACCACAG GGGAAGGGAT AAGGGCCTGA GAGACAAAGG 

24 01 ACAGATGGGG TCTCCCCAAC AATTTAAGTT AAGTTCCACA AGGATACAGT 

24 51 ACTGGCAGAG ATTTGGAAGT AGGGGCAAGT ATT CTGAC AG AAGGGTGGTG 

25 01 TCTTAGGCAC CCTTCAATTA GGAGTAGCTA AAGGCTGTGT GTGTGTCTGT 
2551 GTGTGTGCAT AAGAAAAGAA ATAGGAGGGT GTGTGTGTGG TAAGAAAGAG 
2601 CATCTTGGCT GGGCGCGGTG GCTCACCCCT ATAATCGCAG CACTTTGGGA 
2651 TGCCAAGGCT GGCGGATTGC CTGAGCTCAG GAGTTTGAGA CCATACGGGG 
2701 CAACATGGTG AAACCCCATC TCTACTAAAA ATACAAAAAA TTAGCTGGGC 
2751 ATGGTGGTGC GTGCCTATAG TTCCAGCTAC TCGGGAGGCT GAGGCATGAG 
2801 AATGGCTTGA GCCCTGGAGG CAGAGGTTGA AGTGAGCTGA GATCGCACCA 
2851 TTGCATTCCA GCTTGGGCTA CAGAGTGACA CTCCATCTCA AAAAAAAAAA 
2901 AAAAAAAAAA AAAAACCAGC ATCTTTGCTG CCACTAGTCC ACTGTCTTTG 
2951 CACTCACTCT CTGCCATGCC CATCCTTGTC CCCCTCCCCA CTCACAGACA 
30 01 TGTCCAACAA CAGCCCCGAG TATGCTTTGG TTTTCACCAT CTCGGGTGCT 
3 0 51 ATGGCCACCA TGGTCTCCAG TGGCCTGGGT GCTGCCTGTG GCATGGCCAA 
3101 GAATGGCACC GGCATCATGG CCATGTCTGT CATGTGGCCA GAGCTGATCC 
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3151 ACATGAAGTC CATCATCCCA GTGGTCATGG CTGGTATCAT C AC CAT C TAT 
3201 GGCCTAGTGG CGGCTGTCCC CCCTGCCAAC TCCCTGAATG ATGACAACAG 
3251 TCTCTATAGC AGTTTCCTCC AGCTGGGCGC TGGCCTGAGT GGCCTGGCAG 
3301 CCGGCTTTGC CATCGTCATC GTGGGGGACA CTGGCAAGTG TGGCACTGCC 
3 351 CAGCAGCCCC GACTATTTGT AGGCATGATA CTGATCCTCA TCTTTGCCAA 
34 01 GGTGCTCATT CTCTCCACAA AGCAGCCCCT CTCAAAACCC ACCAGTCACA 
34 51 GAATACGATG TAAAGACCAC CCCTCCTCAT TCCGGAACAA ACAGCCTGAC 
3501 ACGCATGTGC TGGGCAGCTG GCCCTCAGTA GTTGATCTTC TAAGTGTACA 
3 551 GTGTCCTCGT GTTCATCGTC TGTTGGCCAG GCCTTGCCCC CTCCCGCCCC 
3601 ATGCTGTGGA CATCTGAACC TACTCATCAC CCATCCAGGT CCCCGACCAG 
3651 TGAGGACTCA GGCCCCTGGA TGCCCCACCC ATCTCCCTTG AGTACTCTAT 
3 701 GTATAAGGAT GAATTAGAGT TGTCATTTTC TCTTCATTAG ATATTTATAA 
3 751 AGATTTGGCC TGTCCATACC CCTGTGGAGC AGCCCTCATC TCCCACCTAT 
3 801 CTGTCACGTC ATGGAGGTTC CCATTGCGGA GGCTCCTTGG ATGGAACCAC 
3 851 CCTCTCCAGC CCGCGCTGCC AGGCCCTGTG CGGCAGCTGT GTCTGATAAA 
3 901 GTTCTCAGAT GTCGGGGGAG GGAAAGAAAA AAAAAAGAGA GTGTGAGTAC 

3 951 GTAAGAGAGA GAAAACGGGA GTGGGTGTGT GAGCTGGAGA CAGGGAAGTG 

4 001 GCAGGAAAAG TCTGATAAGA TCACCTCCTT CCTACCCAAG CAGAGATACT 
4051 GGACACAGCC CCTCAAGGAC CCAGAGGGTA AGTAGAGCGC GAGATGCTTG 
4101 CCTTTCTCAA TGGGAGGTGG CCTCCCAGGC CTGAAGAAGT CTCCATTTAC 
4151 CCCAGAGCCA ACTAGGAAGC AGGTAGACAG CATCATCCCC ACTTATACCC 
42 01 CAAGGTGCTT GGGGTGAATG GCAGGCCCAA AGCCAAAGCA TGAGACAGAT 
42 51 TAAATGTTCC TATGGCGAGA GAAGGAGAAG GGGTCAC C AG CATCTCTCCA 
4301 CTGAGCAAAT GAAAGGAAGA GAGAAGGCAG GCTGATACCC TCATCAATTT 
4 3 51 CCTACTGTCC ATGATATACC ACCATCAACT GGACTTTTTT TTTTTTTTTG 
44 01 AGATAGAGTC TCGCTTTTGT CACCCAGGCT GGAGTGCAGT GGCATGATCT 
44 51 CAGCTCACTG CAACTTCCAT CTCCCAGGTT CAAGTGATTC TCCCGCCTCA 
4501 GCCTCCTGAG TAGCTGGGAT TACAGGTGCC TGCTACCACA TCCAGCTGAT 
4551 TTTTTTTGTA TTTTTAGTAG AGATGGGGTT TCTTTCTTTT TTTTTTTTTT 
4601 TTTTGAGACG GAGTCTTGCT CTGTCGCCCA GGCTGGAGTG CAGTGGCGCG 
4651 ATCTCGGCTC ACTGCAACCT CCGCCTCCCA GGTTCACGCC ATTCTCCTGC 
4701 CTCAGCCTCC CGAGTAGCTG GGACTACAGG CACCTGCCAC CACACTCGGC 
4 751 TAATTTTTTG TATGTTTAGT AGATATGGGG TTTCACTGCT GTCTCAACCT 
4801 TCTGACCTCA TGATCCGCCC GCCTCGGCCT CCCAAAGTGC TGGGATTACA 
4851 GGCATGAGCC ACTGTGCCCG GCCTTTTTTT TTTTTTTTGA GATGGAGTCT 
4901 CGCTCTGTCG CCCAGGCTGG AGTGCAATGC CACAATCTCA GCTCACTGCA 

4 951 AGCTCCACCT CCGAGGTTCA CGCCATTCTC CTGCCTCAGC CTCCTGAGTA 
5001 GCTGGGACTA CAGGCGCCCG CCACCACGCC CAGCTAATTT TTTGTATTTT 
5051 TAGTAGAGAC GGGGTTTCAC CTTGTTAGCC AGGATGGTCT TGATCTCCTG 
5101 ACCTCGTGAT CCACCTGCCT CAGCCTCCCA AAGTGCTGGG ATTACAGGTG 
5151 TGAGCCACCA TGCCTGGCCT TTTTTTTTTT TTTAAGACAG GAGTGTGGTG 
5201 GCACAATCTC AGCTCACTGC AACCTCCCCT TCTAGGTTCA AGCAATTCTC 
5251 CTGCCTCAGC TTCCTAAGTA TAGTAATAGC TGGGACTATA GGCGCCCACC 
5301 ACCACGCCCG GCTAATCTTT TGTATTTTTA GTAGAGATGG GGTTTCACCA 
53 51 TGTTGGCCAG GCTGGTCTCG AATTGCTGAC CTCAAGTGAT CTGCCCACCT 
5401 GGGCCTCCCA AAGTGCTGGG ACTATAGGCG GGAGCCACCG CGCCCAGCCT 
5451 GGACTCTTTT TAATGAAGCC TTCAAAAAAA CTCCTTTTCT CAGCGCTTCT 
55 01 TACTCTCTGA AACAGACTCT CCACTCTGCT AACCCTGCCT CTCACACTGT 

5 551 GGAACTCAAC CGGATCTTTT TATTCTGAAT CCACAACGTG AAGTACTTGT 
5601 CCTCTGTCTA TCGATGGCTA CCTGTGTTTT GAAGTGTTTT TATGGGAATG 
5651 AAGCACTGGA GGGGAGGAAA TCAGGCCAGT TCTAGAAGTA GAAGGAAGGC 
5701 GAAGAAACCA GGAAAAATAT TTATGTGATG GGAGGAAAGG CAGTTTATAA 
5751 ATCACTCATG GATCTCTATG CCAGAGGGAT GTGTGAGACA CACGCATGCA 
5801 CACACACACT GACTTGCAGG TACATGCAGA GGCAGAAACA AGTCAGGACA 
5851 TGACACATAC ATGAATACAC ATACCATTCT CATCAGAAAC CAGTCAGAGC 
5901 AGAGGGGCCC TGCCTGGAGC AAGGAGACTG GAATTTATTC CCCTCCTCCT 
5951 CTCAAAGGGT AATTTTGCTG CCTCCATGTC TAGGTTCCCC ACAGATCTGG 
6001 CTGCCTCAGA CAGGGGCCCT GGTCTGGTGG CTGGACTCAG CCTGGAGGTC 
6051 TTCACAGATG GAGGCCTATA AGAGGTGGCA GCTGACACCT GGAGGGAGCT 
6101 GGATGAAAGC AGGCAGTGCA GAGTAGAGAA AGCCAGGTGG TGGGGGAGGG 
6151 AGTGAGGGAG AAGAGGGGAC CAGATTCAAG CAGCCTTGCG CTGGTTCTAA 
6201 AATGGCCACA GCAAGGCAAC GGACAGATGG TCCCTTTCTG ATGCTGAGCC 
62 51 GGGGAAGTGG GGAAAGGGAA AAGGAAAAAA TAAACACCAT CACAGTCAGA 
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63 01 AATTTAAAAA TAAACTGAAA AACCTAAAAA ATAAACCGT 



FEATURES : 

Start: 3000 
Exon: 3000-3614 
Stop: 3615 



CHROMOSOME MAP POSITION: 

Bac accession number: AC005973 
Chromosome 17 



ALLELIC VARIANTS 

DNA 

Position Major 



(SNPS) : 



Minor 



559 

3638 

5446 

5808 

5892 

6071 



G 
C 
A 
A 
A 



A 
C 
T 
G 
C 
G 



Domain 



Beyond ORF ( 5 ' ) 
Beyond ORF (3 ' } 
Beyond ORF (3 1 ) 
Beyond ORF (3 ' ) 
Beyond ORF (3') 
Beyond ORF (3 • ) 



Context : 
DNA 

Position 

559 CCCTTCCCTCTGTACCCCGCTCCGCATGCGGCAAGTAGTTCCCGGACGCGACCCTTCCCC 
CTGTACCCCGCTCCGCATGGGGCCAGTAGTTCCCGGACGCGCCCCTTCCCTCTGTACCCC 
GCTCCGCATGCGGCAAGTAGTTCCCGGACGCGCCCCTTCCCTCTGTACCCGGCTCCGCAT 
GCGGAAAGTAGTTCCTACGGTGTTGGTTTTGCATGTAGATGAAACCCTTTGAGGGGTAAA 
GGTTTTTTTTTTAAGTACTTTAGCAAATGCAAACTGTTATTATCAATATTAGCCAGCATC 
[-,A,T] 

TTTTTTTTTTTTTTTTTTTTTTTTTTGAGATGGAGTTTCGCTCTTGTCACCCAGGCTGGA 
GTGCAATGGCAAAATCTAGGCTCACTGCAACCTCCGCCTCCCAAGTTCAAGCGATTCTCC 
TGCCTCAGCCTCCCAGGTAGCTGGGATTACAGGCGTGTGCAACCACACCCAGCTAATTTT 
TGTATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGAC 
CTCATGTGATCCATCCGCCTCAGCCTCCCAAAGTGCTGGGATTACGTAGCCAGTGTCTTT 



3638 GTGTGGCACTGCCCAGCAGCCCCGACTATTTGTAGGCATGATACTGATCCTCATCTTTGC 
CAAGGTGCTCATTCTCTCCACAAAGCAGCCCCTCTCAAAACCCACCAGTCACAGAATACG 
ATGTAAAGACCACCCCTCCTCATTCCGGAACAAACAGCCTGACACGCATGTGCTGGGCAG 
CTGGCCCTCAGTAGTTGATCTTCTAAGTGTACAGTGTCCTCGTGTTCATCGTCTGTTGGC 
CAGGCCTTGCCCCCTCCCGCCCCATGCTGTGGACATCTGAACCTACTCATCACCCATCCA 
[G,C] 

GTCCCCGACCAGTGAGGACTCAGGCCCCTGGATGCCCCACCCATCTCCCTTGAGTACTCT 
ATGTAT AAGGAT GAAT TAGAGTT GT C AT TTTCT CTT CATT AGATAT TTAT AAAGATT TGG 
CCTGTCCATACCCCTGTGGAGCAGCCCTCATCTCCCACCTATCTGTCACGTCATGGAGGT 
TCCCATTGCGGAGGCTCCTTGGATGGAACCACCCTCTCCAGCCCGCGCTGCCAGGCCCTG 
TGCGGCAGCTGTGTCTGATAAAGTTCTCAGATGTCGGGGGAGGGAAAGAAAAAAAAAAGA 



544 6 AGGTGTGAGCCACCATGCCTGGCCTTTTTTTTTTTTTTAAGACAGGAGTGTGGTGGCACA 
ATCTCAGCTCACTGCAACCTCCCCTTCTAGGTTCAAGCAATTCTCCTGCCTCAGCTTCCT 
AAGTATAGTAATAGCTGGGACTATAGGCGCCCACCACCACGCCCGGCTAATCTTTTGTAT 
TTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAATTGCTGACCTCAA 
GTGATCTGCCCACCTGGGCCTCCCAAAGTGCTGGGACTATAGGCGGGAGCCACCGCGCCC 
[C,T,A] 

GCCTGGACTCTTTTTAATGAAGCCTTCAAAAAAACTCCTTTTCTCAGCGCTTCTTACTCT 
CTGAAACAGACTCTCCACTCTGCTAACCCTGCCTCTCACACTGTGGAACTCAACCGGATC 
TTTTTATTCTGAATCCACAACGTGAAGTACTTGTCCTCTGTCTATCGATGGCTACCTGTG 
TTTTGAAGTGTTTTTATGGGAATGAAGCACTGGAGGGGAGGAAATCAGGCCAGTTCTAGA 
AGTAGAAGGAAGGCGAAGAAACCAGGAAAAATATTTATGTGATGGGAGGAAAGGCAGTTT 
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5808 TGAAACAGACTCTCCACTCTGCTAACCCTGCCTCTCACACTGTGGAACTCAACCGGATCT 
TTTTATTCTGAATCCACAACGTGAAGTACTTGTCCTCTGTCTATCGATGGCTACCTGTGT 
TTTGAAGTGTTTTTATGGGAATGAAGCACTGGAGGGGAGGAAATCAGGCCAGTTCTAGAA 
GTAGAAGGAAGGCGAAGAAACCAGGAAAAATATTTATGTGATGGGAGGAAAGGCAGTTTA 
TAAATCACTCATGGATCTCTATGCCAGAGGGATGTGTGAGACACACGCATGCACACACAC 
[A, G] 

CTGACTTGCAGGTACATGCAGAGGCAGAAACAAGTCAGGACATGACACATACATGAATAC 
ACATACCATTCTCATCAGAAACCAGTCAGAGCAGAGGGGCCCTGCCTGGAGCAAGGAGAC 
TGGAATTTATT CC C C T C CT C C T CT CAAAGGGTAATT TTG CTGC CT C C ATGT CTAGGT T CC 
CCACAGATCTGGCTGCCTCAGACAGGGGCCCTGGTCTGGTGGCTGGACTCAGCCTGGAGG 
TCTTCACAGATGGAGGCCTATAAGAGGTGGCAGCTGACACCTGGAGGGAGCTGGATGAAA 

5892 AGTACTTGTCCTCTGTCTATCGATGGCTACCTGTGTTTTGAAGTGTTTTTATGGGAATGA 
AGCACTGGAGGGGAGGAAATCAGGCCAGTTCTAGAAGTAGAAGGAAGGCGAAGAAACCAG 
GAAAAATATTTATGTGATGGGAGGAAAGGCAGTTTATAAATCACTCATGGATCTCTATGC 
CAGAGGGATGTGTGAGACACACGCATGCACACACACACTGACTTGCAGGTACATGCAGAG 
GCAGAAACAAGTCAGGACATGACACATACATGAATACACATACCATTCTCATCAGAAACC 
[A,C] 

GTCAGAGCAGAGGGGCCCTGCCTGGAGCAAGGAGACTGGAATTTATTCCCCTCCTCCTCT 
CAAAGGGTAATTTTGCTGCCTCCATGTCTAGGTTCCCCACAGATCTGGCTGCCTCAGACA 
GGGGCCCTGGTCTGGTGGCTGGACTCAGCCTGGAGGTCTTCACAGATGGAGGCCTATAAG 
AGGTGGCAGCTGACACCTGGAGGGAGCTGGATGAAAGCAGGCAGTGCAGAGTAGAGAAAG 
CCAGGTGGTGGGGGAGGGAGTGAGGGAGAAGAGGGGACCAGATTCAAGCAGCCTTGCGCT 

6071 CCAGAGGGATGTGTGAGACACACGCATGCACACACACACTGACTTGCAGGTACATGCAGA 
GGCAGAAACAAGTC AGGACATGAC AC AT ACATGAAT AC AC AT AC CAT TCT CAT C AGAAAC 
CAGTCAGAGCAGAGGGGCCCTGCCTGGAGCAAGGAGACTGGAATTTATTCCCCTCCTCCT 
CTCAAAGGGTAATTTTGCTGCCTCCATGTCTAGGTTCCCCACAGATCTGGCTGCCTCAGA 
CAGGGGCCCTGGTCTGGTGGCTGGACTCAGCCTGGAGGTCTTCACAGATGGAGGCCTATA 
[A,G3 

GAGGTGGCAGCTGACACCTGGAGGGAGCTGGATGAAAGCAGGCAGTGCAGAGTAGAGAAA 
GCCAGGTGGTGGGGGAGGGAGTGAGGGAGAAGAGGGGACCAGATTCAAGCAGCCTTGCGC 
TGGTTCTAAAATGGCCACAGCAAGGCAACGGACAGATGGTCCCTTTCTGATGCTGAGCCG 
GGGAAGTGGGGAAAGGGAAAAGGAAAAAATAAACACCATCACAGTCAGAAATTTAAAAAT 
AAACTGAAAAACCTAAAAAATAAACCGT 
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